IAES International Journal of Artificial Intelligence (IJ-AI) 
Vol. 9, No. 3, September 2020, pp. 424~428 
ISSN: 2252-8938, DOI: 10.1159 1/ijai.v9.i3.pp424-428 o 424 


Age-based facial recognition using convoluted neural network 


deep learning algorithm 


Julius Yong Wu Jien’, Aslina Baharum/’, Shaliza Hayati A. Wahab?, Nordin Saad*, Muhammad 


Omar*, Noorsidi Aizuddin Mat Noor® 


'.2.3,4U ser Experience Research Lab (UXRL), Faculty of Computing and Informatics, Universiti Malaysia Sabah, 88400 


Kota Kinabalu, Sabah, Malaysia 


Faculty of Business Management, Universiti Teknologi MARA, Sarawak Campus, Malaysia 
SUTM CRES, Faculty of Built Environment and Surveying, Universiti Teknologi Malaysia, Johor Bahru, Johor, Malaysia 


Article Info 


ABSTRACT 


Article history: 


Received Feb 5, 2020 
Revised Apr 12, 2020 
Accepted Apr 27, 2020 


Keywords: 


Age detection 
Convolutional neural network 
Image processing 


Face recognition is the use of biometric innovations that can see or validate a 
person by seeing and investigating designs depending on the shape of the 
individual. Face recognition is used largely for the purpose of well-being, 
despite the fact that passion for different areas of use is growing. Overall, 
face recognition innovations are worth considering because they have the 
potential for broad legal jurisdiction and different business applications. It is 
widely used in many spaces. How it works is a product of facial recognition 
processing facial geometry. The hole between the ear and the good way from 
the front to the jaw are the main variables. This code distinguishes the 
highlight of the face that is important for your facial separation and creates 
your facial expression. Therefore, this study gives an overview of age 
detection using a different combination of machine learning and image 


Prediction processing methods on the image dataset. 
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1. INTRODUCTION 

Facial recognition (FR) is utilized for the most part for wellbeing purposes, despite the fact that 
enthusiasm for different territories of utilization is developing [1]. In all actuality, FR innovation has gotten 
noteworthy consideration as it has the potential for a wide scope of law authorization and different business 
applications [2]. It has been generally utilized in numerous spaces, for example, ATM, social insurance 
framework, driving permit framework, train reservation framework, observing assistance, and identification 
verification. How it functions is the product for facial acknowledgment peruses the face's geometry. The hole 
between the ears and the good ways from the front to the jaw are key variables [3]. The code distinguishes 
facial highlights that are crucial to your face separation and produce your facial mark. Because of deep 
learning techniques, there have been critical advances in FR [4]. In the early stages, exploring the interest, for 
the most part, focused on FR with a deep system of significant light or picture faces. Stephen [5] given a brief 
review of the techniques of deep learning and face-to-face learning and compares some of the basic neural 
formulas based on common convolution neural networks (CNNs). The deep networks used in FR, such as 
deep belief network (DBN), convolutional neural network (CNN, or ConvNet), autoencoder (AE), and others 
are analyzed for architecture [6]. Mandal [7] assessed a significant measure of profound learning strategies 
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for FR. Sepas-Moghaddam [6] studied of FR arrangements dependent on another, all the more incorporating 
and more extravagant staggered scientific classification. Learned-Miller [8] looked at a variety of surprising 
inventive strategies in the Labeled Faces in the Wild (LFW) database. 


2. RESEARCH METHOD 
Figure 1 shows the flow of methodology. This flow consists of phase 1 until phase 3 as below: 

— Phase 1 
In this phase, data is acquired. In the data preparation stage, samples of facial images are acquired and 
then undergo pre-processing to enhance the quality of images. 

— Phase 2 
In this phase, segmented images is then undergone feature extraction process here. These extracted 
features are then used in the training process. 

— Phase 3 

— The final phase is the prediction and evaluation stage where each built model is used to predict the 
input image. The accuracy of each model will be calculated and evaluated. 


Application orr tools 
Data Aquisition to extract data eg. 
camera/phone 


Denoising 


Feature Extraction 


Training Set 


Model 


Result 


Figure 1. Flow of methodology 
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3. RESULTS AND DISCUSSION 
3.1. Data acquisition 

Three databases were utilized in the examination: the BERC database, the PAL maturing database, 
and the FG-Net maturing database [9]. The BERC database contains the face pictures of 390 people the age 
extend 3 to 81 years of age. The facial pictures were gotten utilizing a computerized camera at a high 
resolution of 160 0 x120 0 pixels. The FG-NET developing database is used in this investigation, which 
contains 1002 face pictures from 82 one of a kind subjects, and each subject has 6—18 face pictures named 
with ground truth ages. The ages are flowed in a wide range from 0 to 69. The age transport in either the 
number of pictures or the amount of subjects is especially disproportionate [10-11]. 


3.2. Image denoising 

Numerically, picture commotion is portrayed as a multi-dimensional stochastic process [12]. 
Therefore, picture commotion can be numerically portrayed by scientific insights, for example 
through likelihood thickness circulation work [13]. The presentation of a scientific model can accomplish 
better denoising. 


Noise Model 
(1) Rayleigh noise 


2 (a) z2a 
P(z) = pe ae z<a 
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Condition | is the likelihood thickness appropriation capacity of Rayleigh commotion. When the gray value 
is more prominent than or equivalent to the likelihood thickness bend grades to one side and the fundamental 
region on the correct side is bigger, that is, the dim estimation of commotion focuses is more distributed in 
the right side of the central axis a + V(b/2) 


(2) Gaussian noise 
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Condition 2 is the likelihood thickness appropriation capacity of the Gaussian commotion. The gray value of 
the clamor is around the focal grayscale, which is generally dark, and the high contrast commotion 
circulation is less [14]. Gaussian commotion is likewise called ordinary noise. Its likelihood thickness 
complies with typical circulation. It is a generally utilized commotion model. Gaussian noise is normally 
brought about by awful lighting or high temperature during procurement [15]. 


3.3. Feature extraction 

Feature extraction is a strategy of dimensionality decline by which a fundamental game plan of 
rough data is reduced to logically sensible social affairs for taking care of [16]. Nature of these tremendous 
enlightening assortments is innumerable elements that require a huge amount of enrolling advantages for the 
process [17]. Feature extraction is the name for methods that select and additionally merge factors into 
features, effectively decreasing the proportion of data that must be taken care of, while still absolutely and 
thoroughly depicting the primary instructive assortment [18-19]. 

In this study, the highlights can be separated by utilizing CNN. Utilizing locale-based CNN finding 
key positions, making a sliding window on the picture, and moving the sliding window along the picture to 
get the potential objective zone, CNN is utilized to remove the standard highlights of the objective region, 
that is, to get the yield of fixed measurements as per convolution, pooling and different tasks. At that point, 
the yield vectors of the subsequent stage are grouped (classifiers should be prepared by their highlights); 
and the facial age is anticipated utilizing the CNN model. 
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3.4. Training set 
3.4.1. Convolutional neural network structure 

The classical network structure of convolutional neural system appears in Figure 2. The convolution 
neural system contains a few "convolution layers" and "examining layers" to process the information signal. 
At that point, the mapping between the info signal and the yield result is acknowledged in the full 
association layer. Every convolution extricates the highlights of the information signal through a convolution 
activity of a convolution filter [20-22]. Examining layer is likewise called the "assembly" layer. Its capacity 
is to utilize the standard of neighborhood relationship to down example, which lessens information 
(decreases calculation) and holds valuable data in the system [23]. 

After the picture goes through all convolution layers and testing layers, the element mapping is 
normally changed over into highlight vector yield by full association activity, as it were, the full association 
layer is framed. 

The element mapping can be associated persistently for commonly, and the last full association 
layer is the yield layer. By and by, the yield layer is frequently utilized as the element of picture extraction, 
and these highlights are generally utilized for relapse characterization preparing (for preparing SVM or 
Soft Max) [24-25]. 


Convolution Pooling Convolution Pooling Fully Fully Output 
+ReLU +ReLU Connected Connected perdictions 


dog (0.01) 


Cat (0.01) 
Boat (0.94) 
Bird (0.94) 


Figure 2. The classical structure of CNN 


4. CONCLUSION 

It tends to be anticipated that the programmed facial age evaluation strategy dependent on CNN has 
preferred execution over that dependent on artificial features of SVM. The explanation might be that the 
CNN can procure progressively inexhaustible and significant highlights in facial pictures by learning, 
while the falsely planned highlights can just cover some fixed and single highlights in the picture, 
and the demeanor of highlights is not rich. Overall, this study aims to give an overview of the improvement 
of the accuracy in detecting the age of facial recognition. Besides, facial images datasets that were proposed 
to use in this paper for further study. Facial images from a different region in the world such as CXR from 
Africa or from the western country can be applied too. Furthermore, the total number of datasets use as 
training and validation suggested to increase as it might improve overall performance. 
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